Accelerating Reinforcement Learning with Suboptimal Guidance

نویسندگان

چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Accelerating Multi-agent Reinforcement Learning with Dynamic Co-learning

We introduce an approach to adaptively identify opportunities to periodically transfer experiences between agents in large-scale, stochastic, homogeneous, multi-agent systems. This algorithm operates in an on-line, distributed manner, using supervisor-directed transfer, leading to more rapid acquisition of appropriate policies in systems with a large number of cooperating reinforcement learning...

متن کامل

Accelerating Reinforcement Learning by Mirror Images

あらまし本研究では,強化学習の代表的な手法の Q学習を使用して,追跡問題のための学習速度の向上手法を提案する.本研究のアイデアは,鏡像による対称性を利用して,フィールドの Q値を学習することにある.このことで左右の対象差のみを伴う学習をすることが可能である.また,Q値の同時更新による収束性についても論じる. In this investigation we propose how to accelerate Q-learning which is one of the most successful reinforcement learning methods using mirror images for hunting problems. Mirror images have symmetric differences on right and left views, th...

متن کامل

Accelerating Reinforcement Learning through Implicit Imitation

Imitation can be viewed as a means of enhancing learning in multiagent environments. It augments an agent’s ability to learn useful behaviors by making intelligent use of the knowledge implicit in behaviors demonstrated by cooperative teachers or other more experienced agents. We propose and study a formal model of implicit imitation that can accelerate reinforcement learning dramatically in ce...

متن کامل

Accelerating Action Dependent Hierarchical Reinforcement Learning Through Autonomous Subgoal Discovery

This paper presents a new method for the autonomous construction of hierarchical action and state representations in reinforcement learning, aimed at accelerating learning and extending the scope of such systems. In this approach, the agent uses information acquired while learning one task to discover subgoals for similar tasks by analyzing the learned policy using Monte Carlo sampling. The age...

متن کامل

Shaping as a Method for Accelerating Reinforcement Learning

be facilitated by rst learning to solve related simpler problems. The term \shaping" itself has been attributed to the psychologist Skinner 7], who used the technique to train animals such as rats and pigeons to perform complicated sequences of actions for rewards. Skinner describes how the technique is used to train pigeons to peck at a speciic spot: We rst give the bird food when it turns sli...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IFAC-PapersOnLine

سال: 2020

ISSN: 2405-8963

DOI: 10.1016/j.ifacol.2020.12.2278